Tamil Summary Generation for a Cricket Match
نویسنده
چکیده
Cricket is one of the most followed sports in the Indian subcontinent. There is a wide requirement for natural language descriptions, which summarize a cricket match effectively. The process of generating match summaries from statistical data is a manual process. The objective of this paper is to propose a framework for automatic analysis and summary generation for a cricket match in Tamil, with the scorecard of the match as the input. Data analytics is performed on the statistical match data, to mine all frequently occurring patterns. The paper proposes a parameter called Interestingness, which quantifies the interestingness of the match. The paper also proposes a customization model for the summary. We propose an evaluation parameter called humanness, which quantifies the similarity between the output and a manually written summary. Discussing the results and analyzing the summaries generated for matches based on scorecards, this paper concludes with proposing some extensions for future developments.
منابع مشابه
Impact of Power Play Overs on the Outcome of Twenty20 Cricket Match
This study attempts to find if better performance in power play leads a team to victory in a Twenty20 match. Based on the methodology devised to do so, the study tries to measure the performance of both the teams during power play overs in terms of batting and bowling. The developed measure is called ‘Prod’ which is a product of the difference of batting and bowling performance of t...
متن کاملAn alternate approach towards meaningful lyric generation in Tamil
This paper presents our on-going work to improve the lyric generation component of the Automatic Lyric Generation system for the Tamil Language. An earlier version of the system used an n-gram based model to generate lyrics that match the given melody. This paper identifies some of the deficiencies in the melody analysis and text generation components of the earlier system and explains the new ...
متن کاملLinking Event Mentions From Cricket Match Reports to Commentaries
We focus on the problem of linking event mentions in cricket match reports to instances from temporal commentary data. The problem is challenging because depending on the event type, event mentions could be linked to a single data instance, or to a set of instances. The complexity of the natural language in the reports along with a lack of canonical names or verbose descriptions of the data ins...
متن کاملTemplate Based Multilingual Summary Generation
Summarization of large text documents becomes an essential task in many Natural Language processing (NLP) applications. Certain NLP applications deal with domain specific text documents and demand for a domain specific summary. When the essential facts are extracted specific to the domain, the summary proves to be more efficient. The proposed system builds a bilingual summary for an Information...
متن کاملAuto-play: A Data Mining Approach to ODI Cricket Simulation and Prediction
Cricket is a popular sport played by 16 countries, is the second most watched sport in the world after soccer, and enjoys a multi-million dollar industry. There is tremendous interest in simulating cricket and more importantly in predicting the outcome of games, particularly in their one-day international format. The complex rules governing the game, along with the numerous natural parameters a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011